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Abstract. [Purpose] Intelligent emotion assessment systems have been highly successful in a variety of appli- 
cations, such as e-learning, psychology, and psycho-physiology. This study aimed to assess five different human 
emotions (happiness, disgust, fear, sadness, and neutral) using heart rate variability (HRV) signals derived from 
an electrocardiogram (ECG). [Subjects] Twenty healthy university students (10 males and 10 females) with a mean 
age of 23 years participated in this experiment. [Methods] All five emotions were induced by audio-visual stimuli 
(video clips). ECG signals were acquired using 3 electrodes and were preprocessed using a Butterworth 3rd order 
filter to remove noise and baseline wander. The Pan-Tompkins algorithm was used to derive the HRV signals from 
ECG. Discrete wavelet transform (DWT) was used to extract statistical features from the HRV signals using four 
wavelet functions: Daubechies6 (db6), Daubechies7 (db7), Symmlet8 (sym8), and Coiflet5 (coif5). The k-nearest 
neighbor (KNN) and linear discriminant analysis (LDA) were used to map the statistical features into correspond- 
ing emotions. [Results] KNN provided the maximum average emotion classification rate compared to LDA for five 
emotions (sadness - 50.28%; happiness - 79.03%; fear - 77.78%; disgust - 88.69%; and neutral - 78.34%). [Con- 
clusion] The results of this study indicate that HRV may be a reliable indicator of changes in the emotional state 
of subjects and provides an approach to the development of a real-time emotion assessment system with a higher 
reliability than other systems. 
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INTRODUCTION 

Emotion is essential for our daily interaction with people 
and even with computers. Several studies have addressed 
the subject of emotions and their role in the development of 
Human-Computer Interaction (HCI) and Brain Computer 
Interface (BCI) 1 ' 2 '. Emotion directly affects our decision 
making, perception, cognition, creativity, attention, rea- 
soning, and memory 3 '. Several studies have reported using 
facial expression, speech, and gestures for assessing emo- 
tions. However, in these studies, the emotions were easily 
mimicked by other subjects and did not reflect the inher- 
ent emotional state of the subjects 4 '. In recent years, physi- 
ological signals measured by electrocardiography (ECG), 
electromyography (EMG), galvanic skin response (GSR), 
electroencephalography (EEG), respiration rate (RR), and 
other methods have been used to assess changes in subjects' 
emotional states in a more reliable and non-invasive man- 
ner 2, 4 - 9 '. Physiological signals reflect the inherent changes 
in physiological activities of the subject under different 
emotional states, and a subject cannot control these activi- 
ties related to his/her emotional states. Studies often use 
three different methods to model emotions: valence (un- 
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pleasant or negative to pleasant or positive), arousal (drowsy 
or peaceful to excited or alert), and discrete mode (happi- 
ness, sadness, fear, anger, surprise, and disgust) 7 - 10 ' u '. 
Most studies have used a valence-arousal-based emotional 
assessment using physiological signals because of its easier 
protocol design and simple signal processing methods. The 
mapping of discrete mode emotions in the valance -arousal 
model is very challenging because it is possible to obtain 
overlap of different emotions and multiple emotional expe- 
riences arising from the same stimuli. 

Emotion research to date has mainly focused on the 
field of psychology, yet the mechanisms of psychology and 
psycho-physiology span across numerous other disciplines, 
such as physiology, psycho-physiology, and others 2 '. The 
challenge of an interdisciplinary research area is to stan- 
dardize a common vocabulary and to develop the research 
framework that a mature discipline requires. However, there 
are several major limitations to this kind research work, in- 
cluding: (i) How to induce and measure the emotions? (ii) 
How to remove the effects of noises and artefacts from the 
physiological signals? and (iii) How do we distinguish dif- 
ferent emotions based on heart rate variability (HRV) sig- 
nal characteristics? 

A physiological reaction (activation or arousal; e.g., in- 
crease in heart rate) is a change in activity in the autonomic 
nervous system (ANS) that accompanies emotions. During 
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positive (happiness, surprise) and negative emotions (sad- 
ness, fear, anger, and disgust), significant changes occur in 
the characteristics of the low frequency (LF; 0.03-0.12 Hz) 
and high frequency (HF; 0.12-0.488 Hz) bands of HRV sig- 
nals 12 ' 13 '. Emotions show a range of physiological manifes- 
tations that can be measured with a diverse array of tech- 
niques. Herbelin et al. used five physiological signals to 
assess emotions, including skin conductivity level, EMG, 
skin temperature (ST), breathing frequency, and pulse rate 
(PR) 10 '. Takahashi K et al. used EEG, electrooculography 
(EoG), EMG, pulse oximetry, and skin conductance to 
measure different emotions, including joy, anger, sadness, 
happiness, and relaxation 4 '. In addition, they collected EEG 
signals from 64 electrodes over the entire scalp and used 
them to assess five different emotions achieving a maxi- 
mum mean classification rate of 88.9% using the k-nearest 
neighbour (KNN) classifier 4 '. By using ECG, Jing et al. 
used discrete wavelet transform (DWT) and KNN to clas- 
sify emotions and achieved a maximum average classifica- 
tion rate of 85.78% 8 '. 

Emotion assessment using multiple physiological signals 
usually increases the computational complexity (computa- 
tional time and processor memory requirement) and limits 
the subject's freedom (movements) during the experiment. 
Moreover, no study to date has performed a frequency 
analysis of HRV signals for discrete emotion classifica- 
tion. Therefore, this study aimed to analyze the different 
frequency ranges of HRV signals in order to efficiently clas- 
sifies emotions. We used audio-visual stimuli (video clips) 
to evoke five different emotions (happiness, disgust, fear, 
surprise, neutral). A set of statistical features were derived 
using DWT over two different frequency bands (LF: 0.03- 
0.12 Hz; and HF: 0.12-0.488 Hz) extracted from the HRV 
signal. The statistical features were extracted using the fol- 
lowing four wavelet functions: db6, db7, sym8, and coif5. 
These features were classified using two simple classifiers, 
namely KNN and linear discriminant analysis (LDA). Fi- 
nally, we compared the classification rates of these two dif- 
ferent classifiers over different wavelet functions. 

SUBJECTS AND METHODS 

This work began with data acquisition followed by pre- 
processing, feature extraction, and emotion classification. 

In emotion assessment research, different types of stim- 
uli for inducing emotions must be considered, such as audio 
(music clips/songs), visual (pictures/images), audio-visual 
(film clips/video clips), and emotional recall 5, 6 '. Many stud- 
ies have used audio-visual stimuli of shorter or longer time 
durations in order to induce discrete emotions 10 ' "' I4, 15 '. In 
a study by Li et al., a "Tom and Jerry" cartoon video was 
used to induce the emotion of joy in subjects 5 '. In this work, 
we used audio-visual stimuli (video clips) to induce five dif- 
ferent emotions. A total of 50 video clips were collected 
for five different emotions and each video clip had different 
time duration. 

In this study, we conducted 10 trials to induce each emo- 
tion, with each trial presenting five different emotion video 
clips. Before the video clips were played, some instructions 



Instructions Emotion Inducing Clips 




Natural scenes (Relax mode) 



10 IS XI X2 X3 X4 XS 
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Fea: Fear Sad: Sad Neu: Neutral Ois: Disgust Hap: Happy 



Fig. 1. Audio-visual stimuli based data acquisi- 
tion protocol of first trial 



were given to the subjects in order to relaxed them. In be- 
tween every video (stimuli), two images of nature scenes, 
such as hills, skies, ocean, or mountains, were displayed 
for 8 s in duration. The presentation of scenery was used to 
avoid feedback from previous emotional stimuli delivered 
to the subject before continuing to the next stimulus presen- 
tation 5 '. Figure 1 shows the protocol design for the first trial 
of this experiment. The time duration for each video varied. 
XI to X5 denotes the time period of the emotional stimuli, 
and the maximum and minimum time periods of the emo- 
tional stimuli were 60 s and 30 s, respectively. Some of the 
emotional stimuli were obtained from the Department of 
Psychology at Stanford University. All of the emotional 
clips for the remaining trials were arranged in a random 
manner over the complete protocol. 

ECG signals were collected from 20 university students 
(10 male and 10 female) by 3 electrodes. All participants 
were in good health and had a mean age of 23 years. An AD 
Instrument was used to acquire the ECG signals using three 
electrodes at a sampling frequency of 1,000 Hz. The two 
active electrodes were placed on the left and right wrists 
and the reference electrode was placed on the right ankle 
according to the Einthoven triangle 7 '. Stimuli were shown 
to each subject on a Liquid Crystal Display (LCD) projector 
screen after placement of the ECG electrodes. After each 
trial of this experiment, the subjects were asked to complete 
a self-assessment form to specify the emotion experienced 
during each video clip as well as a rating of the intensity. 
The ECG signals were collected throughout this protocol 
without causing any discomfort to the subjects. 

ECG signals are mostly contaminated with different 
types of noise, such as power line frequency, mismatch of 
electrode impedance, wandering of the baseline signal, and 
motion artefacts. According to Chavan et al., the baseline 
wander and power line interference of 50 Hz or 60 Hz is 
reduced in ECG signals by using a 3rd order Butterworth 
filter 16 '. In a study by Zhang et al., a set of low-pass and 
high-pass filters were used to remove the baseline wander 
and power line frequency interference 17 '. In another study, 
a low pass filter was used to eliminate the peak noises from 
the ECG signals 11 '. In this present study, we used a 3rd order 
Butterworth filter to remove the effects of noise and base- 
line wander from the ECG signals with cut-off frequencies 
of 0.002 Hz and 100 Hz. The HRV signals were derived 
from the ECG signals using the Pan-Tompkins algorithm 37 '. 

Several feature extraction techniques have been pro- 
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Table 1. Statistical features used for emotion recognition and their descriptions 



Features 


Formula 


Description 






Measures the deviation of electrodes potential 


Standard deviation 




from its mean value over different emotional 




1 V k-1 


ECG signals. 


Power 




Measures the squares of the amplitude of 
ECG signal. 



j = level of wavelet decomposition; k = No. of wavelet coefficients, varies from 1 to N 



posed to extract statistical features from physiologi- 
cal signals, such as the Hilbert-Huang Transform 6 ' and 
DWT 10 ' 14, 17, 18 '. Min et al. used conventional features, 
such as mean, standard deviation, median, minimum, max- 
imum, maximum interval, maximum resolution, and spec- 
trum average of the PQRST waves derived from the ECG 
signal for classifying emotions 7 '. In this study, DWT was 
used to extract statistical features from HRV signals. DWT 
is a non-linear transform over the space L 2 (R) that gives a 
time-resolved description for a large variety of signals 19 '. 
Jing et al. reported that the db6 wavelet has even symme- 
try and has a shape similar to the QRS complex from an 
ECG signal 8 '. In addition, other wavelet functions have also 
been used, such as db7, sym8, and coif5, for decomposing 
the HRV signals to extract the LF and HF frequency bands 
which are used to assess emotion recognition 20 '. The coif5 
wavelet has also been used for R wave detection in ECG 
signals 21 '. Based on the literature, a group of 14 wavelet 
functions from three wavelet families (Daubechies, Coi- 
flets, and Symlets) are commonly used for decomposing 
HRV signals 22 '. 

In addition, several types of wavelet function have been 
investigated in HRV analyses 23-27 '. However, very few stud- 
ies have classified emotions based on HRV signals using 
DWT 23, 24 '. An initial set of analyses was conducted with 
the db6 mother wavelet function and then extended with the 
remaining three wavelet functions (db7, sym8, and coif '5) to 
make a performance comparison of the features extracted 
from HRV signals. These wavelet functions were chosen 
based on the resemblance of characteristics of wavelet co- 
efficients in the LF and HF bands with the mother wavelet 
function. Greater performance in feature extraction can be 
easily achieved if the wavelet coefficient characteristics are 
highly similar to the mother wavelet function characteris- 
tics. By using DWT, the single prototype function called the 
mother wavelet is used to decompose the input signal based 
on scaling and shifting parameters 28 '. The mother wavelet 
function fa, b (t) is given as: 

V,j,<t) = A=V(— ) (1) 

where a, b e R, a > 0, and R is the wavelet space. 

Parameters 'a' and 'b' are the scaling factor and the shift- 
ing factor, respectively, since choosing a prototype function 
as the mother wavelet should always satisfy the admissibil- 
ity condition (Equation 2), 



C = I J '-daxcc (2) 

L co 

where W (co) is the Fourier transform of Wa, b (t). 

First, a decomposition tree of the wavelet transform is 
constructed using low-pass and high-pass filters to derive 
the frequency sub-bands of the input signal. When the sig- 
nal is filtered repeatedly by a pair of digital filters (low-pass 
and high-pass filters) that cut out the frequency domain in 
the middle, the time-frequency representation is obtained. 
The filtered coefficient derived from the low-pass filter is 
termed the Approximation Coefficient (A) and the filtered 
output derived from the high-pass filter is called the De- 
tailed Coefficient (D). The approximation coefficient is 
subsequently divided into new approximations and detailed 
coefficients at the next level and so on. The number of it- 
erations or decomposition is purely dependent on the fre- 
quency range of interest in the input signal 28 '. 

In this study, we performed 14 levels of decomposition 
of input HRV signals to derive the wavelet coefficients and 
statistical features of the two frequency sub-bands (LF and 
HF) used for classifying emotions. In the literature, the LF 
band oscillations are reported to be in the range of 0.04- 
0.15 Hz and the HF band oscillations are reported to be in 
the range of 0.15-0.4 Hz 5 ' 9) . Therefore, the frequency range 
of HRV signals in LF and HF used in this study fell below 
the universal frequency range of the LF and HF bands. In 
general, very low frequency (VLF) (0.004-0.04 Hz) detail 
is not useful for extracting meaningful information regard- 
ing emotional changes based on HRV signals, and therefore 
it was not considered in this analysis. In HRV signal anal- 
ysis, most studies have considered the average frequency 
band power and standard deviation as the main statisti- 
cal features for several applications 29 '. Therefore, we also 
considered these features in classifying emotions. Table 1 
shows the description of statistical features and the mathe- 
matical formula for computation. The frequency bandwidth 
and corresponding decomposition level of the ECG signal is 
provided in Table 2. 

Several types of classifiers are used to classify emotions 
from physiological signal features, such as the artificial 
neural network, KNN, LDA, fuzzy clustering, multilayer 
perception neural network, and the support vector machine 
(SVM) 6 ' 7 - u ' 19 ' 30 '. Murugappan et al. used LDA and KNN 
to classify five emotions based on EEG signals 11 '. In another 
study, fuzzy-c-means (FCM) and fuzzy-k-means (FKM) 
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clustering methods were used to classify the four most 
dominant emotions 14 '. 

In this study we used LDA and KNN to classify the emo- 
tions based on HRV signal features. The KNN-based classi- 
fication is a very simple yet powerful classification method. 
The key idea behind KNN classification is that similar ob- 
servations belong to similar classes. Thus, one simply has 
to look for the class designators of a certain number of the 
nearest neighbors and weigh their class numbers in order 
to assign a class number to the unknown 30 '. In this study, 
a range of K-values (K = 2-5) were assessed and the re- 
sults were documented for comparison. The value of K at 
which the maximum classification accuracy is achieved is 
considered to be the optimal value of K for emotion classi- 
fication. LDA is also a classification method that is used to 
discriminate between two or more groups of samples. The 
group discrimination can be defined either naturally by the 
problem under investigation, or by some preceding analy- 
sis, such as a cluster analysis 31 '. The number of groups is not 
restricted to two, although the discrimination between two 
groups is the most common approach 31 '. Besides training 
and testing samples, LDA does not require any external pa- 
rameters for classifying discrete emotions 32 '. Researchers 
have proposed extended versions of LDA, such as pseudo- 
inverse LDA and Kalman Adaptive LDA, in order to avoid 
issues related to singularity problems and adjust the weights 
through an adaptive approach 33, 34 '. 

One statistical feature was extracted from each emotion 
detected in 20 subjects over 10 trials and then concatenated 
to form a feature vector of 1,000 samples in size. This fea- 
ture vector was split into two parts: the training and testing 
vectors. Seventy percent (700 out of 1,000) of samples in the 
feature vector were considered as the training set and the 
remaining 30% (300 out of 1,000) of samples were used in 
the testing set for this emotion classification. 

RESULTS 

Standard deviation and frequency band power were 
used to discriminate among the emotions extracted from 
HRV signals using LDA and KNN classifiers. Tables 3 and 
4 show the classification accuracy of emotions from HRV 
signal features using these classifiers. Among the differ- 
ent values of K (2 to 5), we achieved the maximum clas- 
sification accuracy with K = 5. Among the two different 
statistical features, standard deviation and the sum of aver- 
age power of the LF and HF bands generated the highest 
emotion classification rate compared to the other statisti- 
cal features. In our overall analysis, HF band features alone 
did not give a good classification accuracy for any of the 
emotions. However, the frequency range of both LF and HF 
bands (0.03-0.49 Hz) gave the maximum classification rate 
for distinguishing between the disgust and neutral emo- 
tions. Standard deviation analysis resulted in good discrim- 
ination between happiness and fear among participants. We 
essayed four different wavelet functions from the group of 
wavelet families to extract the emotional relevant features 
from HRV signals: db6, db7, sym8, and coif5. As shown in 
Table 4, coif'5 performed better that the other wavelet func- 



Table 2. Class distribution of the samples in the training and 
test data set for each feature 



Frequency Range 


Decomposition 


Frequency 


(Hz) 


Level 


Bands 


0.03-0.06 


D14 


Low frequency 


0.06-0.12 


D13 


bands 


0.12-0.24 


D12 


High frequency 


0.24-0.488 


Dll 


bands 


0.488-0.9766 


D10 




0.9766-1.953 


D9 




1.953-3.90625 


D8 




3.90625-7.8125 
7.8125-15.625 
15.625-31.25 
31.25-62.5 


D7 
D6 
D5 
D4 


Unused 
frequency 
bands 


62.5-125 


D3 




125-250 


D2 




250-500 


Dl 





tions for three emotions (sadness, happiness, and disgust), 
db7 performed the best for fear, and sym8 performed the 
best for the neutral emotion. Changes in emotional state 
based on the HRV signal are efficiently captured by us- 
ing the coif5 wavelet function because of its characteristic 
waveform matching and symmetry nature. This wavelet 
function gave the maximum classification rate of 88.89% 
for disgust, 79.03% for happiness, and 50.28% for sadness 
emotions. In addition, a classification rate of 78.34% for 
neutral emotions was achieved using the total frequency 
band power (LF + HF) derived from the symS wavelet func- 
tion. Therefore, selecting the appropriate wavelet functions 
for efficient emotion discrimination is very important in 
this type of research. 

Among the five different emotions, KNN performed bet- 
ter than LDA for four emotions: happiness, fear, disgust, 
and neutral. However, the maximum emotion classification 
rate of sadness was achieved using LDA (50.28%). Among 
the five different emotions, sadness had the lowest classifi- 
cation accuracy of 50.28% when using LDA and an extrac- 
tion of the coif '5 wavelet function. The audio-visual stimuli 
used to induce the sadness did not induce a strong emo- 
tional response in the subjects in this experiment. 

DISCUSSION 

In this study, most of the emotional features had a 
marked number of overlapping characteristics, therefore a 
linear boundary could not distinguish each emotion. Con- 
sequently, the classification rate of the LDA classifier in 
most of the classes had poor accuracy compared to KNN. 
In addition, the KNN classifier classifies emotions based 
on a voting scheme and the value of "K", Several studies 
have followed a 'trial and error' approach to choose the ap- 
propriate value of K, but few have determined the effective 
value of K through artificial intelligence approaches 34 '. The 
performance of KNN classification depends on the size of 
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Table 3. Averaged classification accuracy of emotions using KNN (K=5) 



Frequency 


Statistical 


Wavelet 






Emotions 






Average 


band 


Features 


Disgust 


Sad 


Happy 


Fear 


Neutral 


accuracy (%) 






db6 


74.59 


31.25 


74.72 


66.81 


74.87 


64.45 




Standard 


db7 


76.95 


31.53 


71.95 


77.78 


71.53 


65.95 


Low- 


deviation 


sym8 


71.67 


29.86 


76.53 


75.70 


75.14 


65.90 


frequency 




coif5 


72.78 


31.53 


79.03 


75.00 


76.39 


66.95 


band 




db6 


71.11 


31.53 


71.95 


68.06 


77.36 


64.00 




Total average 


db7 


73.61 


32.09 


77.22 


72.09 


75.98 


66.20 




power 


sym8 


74.03 


31.25 


76.53 


73.34 


73.48 


65.73 






coif5 


76.67 


30.56 


76.53 


73.89 


76.12 


66.75 






db6 


57.50 


27.50 


61.25 


60.84 


64.59 


54.34 




Standard 


db7 


59.17 


24.17 


56.12 


60.00 


58.06 


51.50 


High- 


deviation 


sym8 


72.22 


26.81 


65.28 


62.78 


68.19 


59.06 


frequency 




coif5 


65.56 


25.84 


61.81 


68.75 


61.67 


56.73 


band 




db6 


57.50 


25.14 


62.50 


62.78 


62.78 


54.14 


(0.12-0.49) Hz 


Total average 


db7 


56.39 


24.45 


59.03 


58.47 


57.78 


51.22 




power 


sym8 


71.39 


28.20 


66.81 


63.20 


63.61 


58.64 






coif5 


61.95 


25.98 


60.42 


66.53 


64.45 


55.87 






db6 


71.53 


26.25 


75.14 


62.92 


68.89 


60.95 


Ratio 


HF/LF 


db7 

sym8 


72.09 
64.03 


27.36 
27.36 


66.81 
71.53 


67.50 
73.06 


70.00 
74.59 


60.75 
62.11 






coif5 


69.03 


27.64 


73.61 


74.45 


76.11 


64.17 






db6 


86.53 


31.81 


73.61 


65.56 


72.08 


65.92 


Sum 


LF+HF 


db7 

sym8 


82.50 
76.12 


31.95 
30.70 


74.45 
76.39 


67.50 
70.83 


72.37 
78.34 


65.75 
66.48 






coif5 


88.89 


30.42 


75.28 


72.36 


72.09 


67.81 



the feature vector. Larger feature vectors result in a poor 
classification rate for KNN. Therefore, the optimal value 
of the feature vector is critical for achieving good classi- 
fication accuracy 34 '. Normal subjects have high variability 
of emotional perception across a large population pool, but 
KNN provided the maximum classification rate of 75% or 
higher for most of the classes. The LDA classifier is simple 
to use, has fewer computational requirements, and provides 
good results for several classification applications 35 '. How- 
ever, LDA is not optimal for nonlinear EEG data due to its 
linear nature. The greatest limitation of LDA is that it only 
allows linear or quadratic relationships between the input 
and output 33 ' 34 '. 

In a study by Abdallah et al., audio-visual stimuli (film 
clips) were used to evoke the three most common emotions 
in a group of subjects: pleasant, unpleasant, and calm 23 '. In 
their study, the maximum mean classification rate achieved 
using two physiological signals (HRV and GSR) was 80.2%. 
More recently, a study classified six basic emotions using 
ECG signals and achieved a maximum mean classification 
rate of 61.4% using visual stimuli 24 '. In another study, mul- 
tiple physiological signals (BVP, EMG, and RR) were used 
to classify the visual stimuli-induced emotional states into 
three types: pleasure, non-pleasure, and neutral 36 '. How- 
ever, although visual stimuli have been mainly used to in- 
duce emotions in previous studies, they achieve a low clas- 
sification rate compared to audio-visual stimuli. For HRV 



signals, no study has yet analyzed the individual frequency 
bands for classifying emotions with DWT Localization of 
the specific frequency range in ECG signals is important for 
reducing the computation time and complexity of emotion 
assessment. Furthermore, there is currently no universal 
emotion database, and many studies have focused on devel- 
oping unique databases for emotion classification. 

In this study, we achieved maximum average classifica- 
tion rates of 69.754% and 67.808% for classifying five dif- 
ferent emotional states using KNN and LDA, respectively. 
For both classifiers, coif'5 provided the maximum mean 
classification rate. The mean classification rates of the dif- 
ferent wavelet functions did not vary drastically in the pres- 
ent study. We found that LF band features provided more 
useful information for distinguishing emotions than HF, 
LF+HF, and frequency band ratios. Compared to previous 
studies, the method investigated in this study used efficient 
emotional stimuli to induce different emotions and found 
that a simple signal processing and classification method 
is highly useful for reducing the computational complexity 
and time. 

Based on our current understanding, mapping the dis- 
crete emotions on a two-dimensional (valance -arousal) 
plane is complex. This is because the emotional percep- 
tion of each subject varies from other subjects and each 
subject has a diverse emotional experience over time when 
confronted with the same emotional stimuli. In addition, 
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Table 4. Averaged classification accuracy of emotions using LDA 



Frequency 


Statistical 


Wavelet 






Emotions 






Average 


band 


Features 


Disgust 


Sad 


Happy 


Fear 


Neutral 


accuracy (%) 






db6 


68.48 


48.75 


71.81 


70.70 


71.11 


66.17 




Standard 


db7 


75.14 


47.08 


74.03 


75.83 


71.25 


68.66 


Low- 


deviation 


sym8 


76.11 


48.33 


72.78 


74.86 


76.53 


69.72 


frequency 




coif5 


74.31 


48.75 


74.17 


74.31 


77.23 


69.75 


band 




db6 


65.56 


48.75 


71.67 


73.33 


67.50 


65.36 




Total average 


db7 


69.73 


46.39 


70.56 


72.64 


68.47 


65.56 




power 


sym8 


71.11 


48.89 


72.64 


72.50 


72.22 


67.47 






coif5 


71.67 


50.28 


71.67 


73.20 


71.39 


67.64 






db6 


57.36 


43.89 


64.17 


55.00 


63.20 


56.72 




Standard 


db7 


55.56 


43.75 


60.56 


57.50 


59.17 


55.31 


High- 


deviation 


sym8 


59.86 


42.50 


61.39 


61.53 


63.75 


57.81 


frequency 




coif5 


58.89 


43.75 


59.45 


61.81 


67.22 


58.22 


band 




db6 


55.70 


42.78 


60.00 


53.34 


62.36 


54.84 


(0.12-0.49) Hz 


Total average 


db7 


55.70 


41.67 


56.39 


56.95 


57.50 


53.64 




power 


sym8 


56.95 


39.59 


58.06 


57.92 


64.03 


55.31 






coif5 


56.39 


44.44 


57.78 


56.53 


61.11 


55.25 






db6 


73.89 


43.62 


70.97 


66.67 


78.62 


66.75 


Ratio 


HF/LF 


db7 
sym8 


74.03 
75.28 


47.78 
44.31 


69.03 
72.09 


70.70 
71.67 


73.61 
71.53 


67.03 
66.98 






coif5 


77.92 


45.56 


71.53 


74.58 


72.36 


68.39 






db6 


63.75 


47.23 


73.89 


70.84 


66.67 


64.48 


Sum 


LF+HF 


db7 
sym8 


63.06 
69.86 


46.67 
48.47 


70.97 
74.72 


72.50 
69.86 


69.31 
68.48 


64.50 
66.28 






coif5 


65.70 


49.31 


72.64 


71.81 


72.22 


66.34 



the estimation of physiological activities through multiple 
physiological signals may provide good results for emotion 
classification. However, the handling of multiple single fea- 
tures in a signal processing method increases the computa- 
tion time and complexity. Therefore, the search continues 
for one salient physiological signal which can efficiently 
classify multiple emotions among multiple physiological 
signals. 

We used audio-visual stimuli to induce the five most 
dominant emotions in 20 subjects in order to assess changes 
in their emotional state using ECG signals. The ECG sig- 
nals were preprocessed using a Butterworth 3rd order filter 
and DWT was used to extract the statistical features from 
four different wavelet functions. The extracted features 
were mapped into corresponding emotions using KNN and 
LDA classifiers. Using this approach, we extracted standard 
deviation and average frequency band power from LF and 
HF frequency bands. Based on these results, we found that 
the sum of the LF and HF frequency band power and stan- 
dard deviation performed better than other statistical fea- 
tures for classifying emotions based on gender. 

A maximum classification rate of 88.89% was achieved 
for disgust in males using KNN. The experimental results 
also indicated that the KNN classifier performed better than 
LDA at classifying most of the emotions. Amongst the five 
different emotions, sadness achieved the lowest emotion 
classification rate compared to the other emotions, indicat- 



ing that the stimulus used in this experiment to evoke sad- 
ness was not efficient. The selection of wavelet function, 
emotional stimuli, and statistical feature computation plays 
a major role in achieving a good emotion classification rate. 
In a study by Cong et al., four physiological signals (ECG, 
EMG, SC, and respiration) were used to classify four emo- 
tional states, achieving a maximum mean classification rate 
of 76% 38 '. Real-time emotion recognition using ECG signals 
was also developed by Kanlaya et al. and achieved a mean 
classification rate of 61.44% 24 l Another study achieved 
a maximum mean classification rate of 97.8% when two 
emotions (happiness and sadness) were classified 39 ^. The 
method investigagted in this study used a single physiologi- 
cal signal to classify five emotions and achieved a mean 
classification rate of 66.48% using simple classfiers. This 
analysis was performed "offline" using the MATLAB sig- 
nal processing toolbox. In future studies, we plan to utilize 
ECG signal morphology features, such as QRS complexity, 
R-R peak detection, S-T time interval, R wave detection, 
R-R peak amplitude, and QRS slope deviation as statistical 
features to examine the correlation between ECG signals 
and specific emotions. 
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